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Figure 2A. 



1 GTTCGACCAA ACGGCTTGTT GTGCGGTGAA ACATAGCACT CCTTGTGGCG TGGCTTTAGA TGATGATATT TTGCAAGCGT 
» F5 » CTTAAGCTTGGA» F13 » 



Hindi XI 

81 ACCAAAAAGC ACACGACTGC GACCCGATTT CGATTTTTGG TGGCATTGTA ACTTTTAATA AAAAAGTAAC AAAAGCAGTG 

161 GCAGAAAAAT GTAACGAGAT TTTCCTTGAA ATCGTTGCTG CACCGAGCTT TGAGCCAGAG GCTTTGGAAG TTTTTGCTAA 

241 AAAGAAAAAT TTGCGCGTGA TTGAAGTTAA AAATCCATTA AG CGATAAAA TGCAACTCGT GCAAGTAGAT GGCGGATTGC 

321 TCGTGCAAGA AAT CGAC AAA TCGTTTAGCA ATGATTTTAA AGTAGTAACC GAAAAACAAC CTACCGAAAA GCAACTTTCT 

401 GATTTGGAAT TTGCCATGAA AGTAGTGAAA CAT GT AAAG A GCAATGCCAT CGTGGTTGCC ACAAACGGAC AAGCTCTAGG 

481 CGTGGGCACA GGCGAGACTA ATCGTATTTG GGCAGCACAG CAGGCGATTC AGCGTGCAAA GGAAAAAACA CAAGAAAATC 

561 TAGTTTTGGC TTCCGATGCC TTTTTCCCAT TCAGAGATGT GGTAGATTAT GCAGCACAAG AAGGCATTAC AGCCTTGATT 

641 CACCCAGGAG GAAGCATGCG CGACCAAGAG AGCATAGACG CGGCTAATGA AC ACGGAAT C CCGATGATCA TCAGCGGTAT 

721 GAGACAT TTC T T AC AT T AAA TCAAAAAATC TAAACAATAA TTATCAATAA TTCTAAAACA CAATAAGTAT GAATGCAAAT 

». . .purD. . .> 

801 GATTACAAAA AAAT ACT C AT C GTAG GAAAC GGCGCAAGAG AAC AC GC CAT CGGGTGGAAA ATTAAACAAG ACCACCCTTC 
> purD , > 

881 TTGCGAGCTT TTCTTTGCGC CAGGAAACGC TGGAACCGAA CAAATTGGAA AAAAC AT C GT AGCTGAATCT AATTATGGCT 

> . purD > 

« OE-R «AGATCTGGCGCTACGCTAGAAG 



Bglll 

961 TAATGCTTTT TGCTCAACAA AATGATATAG ACTTAACGAT TGTAGGTCCA GAAGCAGAAT TGGTAGAAGG TATTGTAGAC 
> . . purD . .> 

104'1 TTGTTTGAAT CCAATCAATT AAGAATTTTT GGTCCAGATA AGCGTGCGGC TAAATTGGAA GGCAGCAAGG CTTTTGCCAA 
>. . . . . ,purD ....... > 

1121 AG ATT T TAT G GAGAAATACG GCGTGCGCAC GGCTTTTGCC AAAAGTTTCA ACAATTTTGT AGACGCTAGA GATTATGTAA 
>...... . ^ . . . . . . purD . . . ....... . > 

1201 AAGAGCTCAC GCAATTCCCT ATCGTGATCA AAGCCAGTGG CTTGGCAGCA GGAAAAGGTG TGATCATCGT GCACNTACAA 
> . purD . > 

1281 CTTGAAGCCG AAACTACTTT GCGCAAAATC AT G G AAGAC A AAACCTTTGG CGAAGCAGGC AACGAGGTCG TAATCGAGGA 
> purD > 

1361 ATACTTAAAA GGTGTGGAAG TTTCTGTGCT TTCTATCTTT AACCATAAAG AAAT T AAAAC TTTCTTGCCT GTAAAAGACC 
> purD > 

1441 ACAAGAAAAT CGGAAAAGGC GAAACAGGAC TCAACACGGG CGGAATGGGC GTAGTGGCTC CTAACCCGCA TTTTACCGAT 
> purD > 

1521 GAGCACATGA AGGAGTTTGA GAAAAACATT TTGCTCCCAA CACAAAAAGG GCTCTTGGCA GAAAAAATGC ATTTTGCAGG 
> purD > 

1601 CAT TAT T TTC TTTGGGCTTA TGATTACCGA GCATGGTATT TATCTATTGG AATACAACAT GCGATTTGGC GACCCAGAAA 
>.. purD > 

1681 CCGAAG C ACT TTTGCCTTTG ATGGAGAATG ATTTAGTAGC CCTCATCGAT TCCGCAATAC ACCAGCAAGA CATTGAACTT 
> - purD , > 

1761 AAATGGAAAA ACGAACATGC TTGCTGTGTA GTAATGGCGA GCGGTGGCTA CCCAGGCACT TACGAAACTG GTTTTGAAAT 
> purD > 
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1841 CCGAGGATTG AACAAAGTTG ATGTTCCCGT ATTTATTGCA GGAGCCAGAG AAGAAAGTGG AAAAATCTAC ACCACAGGCG 
> , .purD, ...> 

1921 GGCGCGTGCT CAATGTGGTG GGAACTGGCG CTACGCTAGA AGAAGC C AG A AAAGTGGCTT ACGAAAATAT CCATAAAATC 

> . „ purD . . . > 

GAGATCTGG» OE-F » 

Bglll 

2001 AATTTTGATT ATGAATATTA TCGCGAAGAC ATCGGGAAGA TATAATCTCG CTGATTTTTA ACCAAAACAT ATTTAAAAAC 
> purD » 

2081 GCTTTTGTTA CTTTTATAAA CAAAGGCGTT TTTCTATTTT TGTGCCACTA TAAC AT GATT TAACCCATGA AAAAAATACT 

2161 AAAAAT AC T C ATTTTTCTAC TGCTCATTCC TTGGGTTTAT GCCCTGATTT TAATCTTTAT AAATCCACCT ATCACCATTA 

2241 CACAGCTGAG CAATTTATCT TATGGTTTCT CCAGAACACA GCTCGCTTAT GATGAAATTC CGGCTAGTGC TAAATGGGCT 

2321 GTAATTGCAG CAGAAGACCA GAATTTTGCC ATTCATAATG GCTTTGATTT TAAAGAAATT AAAACCGCCT ACGAGAAAAA 

'2401 CAAAGCGGGC AAGAAATTGC GTGGCGGGAG CACCCTTTCG CAACAAACTG C C AAAAAT GT ATTTTTGTGG CAAGGGCGCA 

2481 CTTGGATTAG AAAAGGATTG GAAACCTACT GC AC C T T TAT CATCGAAACG CTGTGGAGCA AGGAGC GTAT TTTGCAAGTT 

2561 TACCTCAACA ATGCCGAAAT GGGCAAAGGC GTTTATGGCA TAGAGGCAGC GGCGCAATAT TAT TTTAAGA AAAACGCCTC 

2641 ACAGCTCACG CCTACCGAGA CGGCACGCAT CATTGCCTGC CTGCCCAATC CCAAAAAATA CAATNTAAAC CCGCCAAGTG 

2721 CCTACATCTC AAAACGCGGA CAATGGATTC TGCGCCAAGT GCGAAACTTG AAAGGCGATA GGGCTCTGAG CGAGATTGTG 

2801 AACACGCCCT AACGCCTGCC TCAACTCTTT GCACACAGTT TACCAACTCT CTGCGAAGAG TTCACAAACT CTTCGCACAC 

2881 ACTTCCCCAA GTCTTTGCAA AGAGTTGGGA GATACTTAGG CACAAAAAAA AGGAACCTCA TGAATAGAGG TTCCCTCTTC 

2961 CTTAAAAGGA ATAAATAATA ATGTTTTTTA AGCTTTAGGC TTGGCTACTT TTTCAAAGCC TGCTGCCTTC ATGCTATCTA 

Hindlll 

3041 GGATACGCTT GCCTGGGCGG TAGTTTACGC CTACCTTTTT GAT TAAGCCC GAATGAAAAT CTTTCTCTGT ATCTGCCGCT 

3121 CCACTGCTTA AAGTGGCATA GAGC GAG CCA AGCTTATCTA AACGAACGAT TTTGCCCGCT GCCAAGGCGT CTTGAATTAC 
<R8 . «AAGCTTAAG 

Hindll I HindU I 

3201 ATTCTCTAGC GCAATGATAA CGCCACGAAT ATCTGCCTCG CTGAGTGCCG AAAACTTCTC GATTTGCTTA ACGAGCTGGT 

3281 CTATAT CC AT TTCTCCATCG CTTGCCACCA CGGCATAGTA TTTTTGTGGC TCCCCTGGCT TGCTTGGGTT TCTACGCTGA 

3361 ATTACATTGT ATTTTATGCT CATAATTACT CTATTTTTAA TAGCCTCCCG ATGGATATAA AGTTACGCTA CAATTAGGGT 

3441 CTCCATAAGC AAATC TATAC CCCTCTCTTT CATATTCCCT TCTCATTCTT CTTGCTCCAT CTCTCAAGGC ATCCGCTCTA 

3521 TTACTGCTAT ACCCCTCCTG AAGAAATGTG TCTGCACTTG AAGAAGAATA TGAAGAGCTA TGAGAATCGT GCAACATAGT 

3601 CCAAGCTCCA TCTTGAGCTA TAACATTTGC ATGACATGTA ACACCTATAG TATAATAAAA TCTCCTAGGA GGTTGTGTTC 

3681 CACCACCACC TCCAGAGCTA CTACTTTTTT TACATTGTCC ATTTTGGTTA GCATGATTTT GTCCGCCATC AC T TAG TAAC 

3761 TTCTTAGCTT CTGCTAAGGC TTTTTCTCTT GCTTTCTTTT CAGCATCTGC TTGGCTAATT CCACTCACTG CTGTAGCTGT 

3841 CGCTTCTTTT TTATAGTTTA CCGAGGTTCC ATAATAGCCA CTACTACAAT TGTTTCTTGT AAAGTTTTTA TTAAAAGATT 

3921 GAGTTTGTGT TGAGGTGTAC CCTCCGAAAC CTTTTACTTC TACAGTAAAG GTAGAACTCC CCATGCTTAC GGGGAAGGTG 

4001 GCGATAGTAT ACGATTGCCC TGCCGGCATT TGTTTTACTT GATACACTCC ATCTCCTCCC ACTTCTATGC TTGCCGTTAA 
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4081 AT T AC C ACTA CCGCTAAAAG AGCCTTCTGC TATTTTTAGT GTTAAATCAT TTATATCCCC TCCTTGTCCT TTTGCAGAAG 
4161 CTTTTGTTAC ACTTACAGCA TCATAAGCTC CTTTTCCATT GGTATAAGGT ATTTATATGG CCAAAC 
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Figure 2B. 



1 TAAAGCTGTA AWTCGCTATA AACGCCCTTT AG G AT AAAAT CTGCCATTTT TTGCAGTATT TTWATAGCTA AAATTTAGAA 

» FrecAOBl » 

81 AACACCATCT CGAGTAAAGG AGCGTGTAGT GCTCGCCATC GTTGAGCGAT TGCCCACCCT CAATTGATTT GGGCGAATAC 

CTTAAGCTT» F6 » 



Hindi! J 

161 TTGAAATAAA TGGCATCTTC TAGCGACACA TTTTGCGCAG AAATCATGCA AAAAGCCCCG CATAAAAAGC TGAATAAAAA 

241 WGCTAWTYTT CTTGTTTAAA AAAACTCATA AATTCCCCCA AATATAGAAA TATTCTGTGA AAAGTTGCAA T T TATTAAC A 

« . . . < 

321 CTATGTGCTT GCTTTTAATG AAAAAAGTAG ATTATTTTTC CGAATCCGAA AGTTTATTTA CGCCCCATCC GATGCCTAGT 
< . . Free A- 4 . * . « 

401 CCCMSCGATA GCCATGATTA ATACAAATAC AAT TAAATC A WATTTTTCMC MTWWACCATA GCACAACACT TGCTAGCTCA 

481 ACGAGTACTA GAGTGGTAAA AAGGATTTTT TGACGATTAT TCATGATTTT ATTTTTCTCA AAGGTAAATA TTTTAAACCA 

561 TAATTTCACA AATCTTAAAA TCTATTTAAA TAATAGAGAA AC C AGAAAAA AATCGTATTT TTACGGAATG AATAAAAT GT 

641 TACAAGTAGG CGATAAAATG CCCGATTTCA AAGGTGTAGA CCAATTTGGG AAGGAGCATT CATCTGCCGA TTTCAAAAAT 

721 C AG AAAT TAG TCGTTTTTTT CTACCCAAAA GCCAGTACGC CAGGTTGCAC GGCAGAGGCT TGCAACATCA ACGATAATCT 

801 TGATGCGCTA AAAGCACAAG GCTACCAAGT GATAGGCGTG AGTGCAGATT CGGTAGAAAA ACAACGAAAA TTCAGTGATA 

881 AATAC GAT T T TAAATTCCCT GTGATTGCCG ATGTGGATAA G AAAAT TAT T GAAGCATTTG GCGTGTGGGG CGAAAAGAAA 

961 TTCATGGGTA AAACCTATGA CGGAATTCAT CGTACGACAT TCATTATTGA TGAAAACGGA GTGGTGGAGC GCGTGATAGA 

******* F7 ******* 



EcoRI 

1041 AAAAGTGAAA AC AAAAG AT C AT AC C AAT C A AAT T T T AAAT T C AGAAAAAT AAAAATATGA GCGAAATAGA CGAAGCGAAA 

» * . . .jrecA > 

1121 AGGAAAGCAC TCCAGCTAGT GCTTGATAAA AT GG A.C AAAA GCTATGGTAA AGGTGCCGTG ATGATGATGG GCGACAAAGC 

> recA * > 

« OER1 < 

1201 CATAGACGAA AATATTCCAG TAATCCCTAC GGGGTCTCTA GGTTTAGATT TAGCCTTGGG CGTGGGAGGG TATCCGCGCG 

> recA > 

<CGAGATCTCGTGCGTGCGGT 



BglXI 



1281 GTAGAATCGT GGAGATTTAC GGTCCAGAAT CTTCTGGTAA AACCACTTTG GC AAT T CAT G CCATTGCCGA AG CTC AAAAG 
> ,..*...*... recA. * »..,* > 



1361 


TCTGGCGGAA 


TTGCAGCTTT 


CATCGATGCA 


GAGCACGCAT TTGATAGATA 


TTAC GCAGAA 


AAATTAGGCG 


TAGATGTTGA 


1441 


G CAT T TAATT 


ATCTCTCAGC 


C AGATAAT G G 


GGAGCAAGCT T TAG AAAT TG 


CCGATAACTT 


AATCCGTTCA 


GGTGCAATTG 
> 










HindllX 








1521 


ATATTATTGT 


AATCGATTCG 


GTAGCGGCTT 


TAACGCCAAA GTCGGAAATC 


GACGGAGATA 


TGGGCGATTC 


CAAAATGGGA 
**.*.*> 


1601 


TTGCAAGCGC 


GTTTGATGTC 


TCAAGCCTTG 


AGAAAGCTCA CGGGAACTAT 


CAATAAAACC 


AAATGTACTG 


CTATTTTCAT 
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1681 CAACCAATTG AGAGAGAAAA TCGGTGTGAT GTTCGGTAGT CCAGAAACCA CAACGGGTGG TAATGCACTT AAATTCTATG 
> ......... r&cA , . > 

1761 CATCGGTGCG TCTAGACATT CGTCGTTCTA CTCAGATTAA AGATGGGAAC GATGTCATCG GAAACTTGAC TCGCGTAAAA 

........... recA ...... .......> 

1841 GTAGT GAAAA ACAAAGTAGC TCCGCCATTC CGTAGTGCAG AATTCGACAT TATGTATGGC GAAGGAATCT CTAAAGCAGG 
>... recA , » > 



EcoRI 

1921 CGAGATTTTA GACATTGCTA CCGATTTAGA AATCGTGAAA AAAAGTGGCT CTTGGTATTC TTATGCAGAT ACTAAACTAG 
> recA - > 

2001 GACAAGGGCG AGATGCCGTG CGTGCGGTAT TGAAAGATAA TCCAGAATTA GCCGAAGAAT TAGAAGAGAA AATTAAAGAA 

>.... . . . , , recA , . . . > 

CGAGATCT» OEFl » 



Bglll 

2081 GAAT TAGAGA AAAAATAGAT TTTTTAGTTT TTTTAATTAA AC G AAAAAT C CGTTCACTTT GTTGAACGGA TTTTTTTATG 

>..... .recA » 

2161 CTTGAATGAA TTTATTTCCA ATGGATTGAA TAGCCATGCA CTTTTAAATC TTCGCTATCA TAAGTGATTT CTTTGTCGGT 

2241 GTTGGGATAG CAAACTTTAA GTCCTGCGTA TTTGGCAATG GCATGTCCTG CGGCAATGTC CCAAAAGTTT ACAGGTCTAA 

2321 AGCGGGTGTA CTCCGTAGCC CACCGATCGG CAATTAGCCC AAGTTTGATA ACGCTTCCCA TAGGCTTTGT GCGGAAAATT 

2401 TCATGTTCGG ATTTAATTTT TTTGATGTAT TCCTCGGTGC CAGGATCCAT GTGGAATTTG CTACAAAGAA AAGTGTAATC 

2481 TTCGGGCAAA TCCATGGTAG GAATTGGCTT GCTGTGTTTC ATCAATTGTT CAAAAAAATC CGATTT CAGA GCCATTTTGT 

2561 GCAATTGTTG TTGAGTCCCG ATGAATTTAC GAGAAGGGCA TTTATCGCTA CCGAAATAGA ACAATCCAAG CGATGGGGCG 

2641 TACAAAACTC CTAGCTTAGC CGTATTATTC TCAACTAAGC C TAGAC AC AC GCAATATTCA TCTGTTTTGT TGACAAAATC 

2721 CATGGTGCCA TCAATAGGGT CTGCAATCCA ATAGGTGGGC GTATTTCTAA TTTCTTGTAA AGAATCCTTA TCTCCTTCCT 

2801 CACTAAAGTA TGGAATGTCT GTAAAG GAAA CATGTTTTTG CAAGATTTTG TTGGCGGCTA AATCTGCACT TGTAACAGGC 

2881 GATCCGTCGG CTTTGGTCTC GGTGGAGAAT CCGTTTTGGA TTGTTTTAAA ACCTCTTCGC CAGCAAGTGC TACAGCCCGT 

2961 GTTGCGATTT CTAATAAATT C ATAAT CAT T CTTTTATTCT CGAACAAAGT CAAATAATTC TCTGTATTAA AAAATAATTT 

3041 TGGCGATAAA AATTAAAATT TATATATAAA ATATCTCTGC AAAAAACCAA ATCAAATATT TAGTGAAATA AAAAAAATTA 

3121 GATTG TAAAT TTGCCTTATG TTTT TAGAGA AT AC C ATAAA TCATAGAAAA AATACGGGCT GGATCGAAGT AATCTGTGGC 

3201 TCTATGTTTT CGGGCAAAAC CGAAGAGTTG ATTCGTAGAG TGAAACGAGC CGAATTGGCT GGGCAAAAGG TAGAAATCTT 

« R5 «AAGCTTAAG 

Hindlll 

3281 TAAACCCGCA AT TGATAAAC GCTACGATGA GCAAGATGTG GTATCGCATG ATGAAAACAA AAAACAAGCA ACCCCGATTG 

3361 AGGCGAGTTC TAACTTGCCC AT TT TAG C AA GCGATTGTGA TGTGGTGGGG ATAGATGAGG CTCAATTCTT TGACGAAGGA 

3441 ATTGTTGAGG TGGCAAATCT TTTAGCTAAT TCGGGGAAAA GAATAAT TAT TGCGGGATTA GACATGGATT TTAAAGGTCG 

« RrecAORl « 

3521 TCCATTTGGT CCTATGCCAA ATTTAATGGC GGTAGCGGAA TATGTGACCA AAGTGCATGC AATCTGTGTG AAAACAGGGA 
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